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Abstract 

A measurement of the ratio of the inclusive 3-jet to 2-jet cross sections as a function of 
the total jet transverse momentum, Hi, in the range 0.2 < Hj < 2.5 TeV is presented. 
The data have been collected at a proton-proton centre-of-mass energy of 7 TeV with 
the CMS detector at the LHC, and correspond to an integrated luminosity of 36 pb _1 . 
Comparisons are made between the data and the predictions of different QCD-based 
Monte Carlo models for multijet production. All models considered in this study are 
consistent with the data for Hi > 0.5 TeV. This measurement extends to an Hi range 
that has not been explored before. 
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1 Introduction 

In leading-order (LO) perturbative Quantum Chromodynamics (pQCD), jet production in pp 
collisions occurs when two partons interact via the strong force to produce two final-state par- 
tons. The elementary processes that contribute in the final state are qq — > qq, qg — > qg, gg — > gg, 
and gg — > qq, where q and g are quarks (or antiquarks) and gluons, respectively. Each of the 
final state particles may subsequently lose energy by emitting other quarks and gluons in a 
process referred to as a parton shower (PS). Finally the products of the parton shower un- 
dergo hadronisation and form hadron jets. Events with three or more jets in the final state 
originate from hard-gluon radiation and other higher-order QCD processes. The jet transverse 
momentum (pj), azimuthal angle in the plane perpendicular to the beams ((p), and rapidity 
(y = gln[[E + p z ]/[E — p z ]], where E is the jet energy and p z is the momentum component 
in the direction of the colliding protons) are strongly correlated with those of the final-state 
partons. For this reason jet measurements have been used to test pQCD HHU. 

This letter presents a measurement of the inclusive 3-jet to 2-jet cross section ratio, R32, in 
proton-proton collisions at = 7 TeV. The measurement is performed using a multijet 
data sample collected in 2010 by the Compact Muon Solenoid (CMS) experiment, at the Large 
Hadron Collider (LHC), and corresponds to an integrated luminosity of 36 pb -1 . Jets are re- 
constructed in the CMS detector using the anti-fey- jet algorithm [|9l [T0l , with the requirements 
pj > 50 GeV and rapidity \y\ < 2.5. This cross section definition is chosen to maximise the 
phase space of the measurement. The value of R32 is measured as a function of the total jet 
transverse momentum of the event, Hj, defined as 

H T = £>T f , 

;=1 

where px; are the transverse momenta of all jets found with pj > 50 GeV and \y\ < 2.5 in an 
event. This definition provides a general estimate of the energy available to the parton-parton 
hard process and is less sensitive to detector effects than pj. Major systematic uncertainties 
that hamper jet cross section measurements, such as those due to the jet energy scale and the 
jet selection efficiency, largely cancel in the measurement of R32, whilst the uncertainty on the 
integrated luminosity measurement cancels completely. Hence, a measurement of R32 is subject 
to smaller uncertainties than the absolute multijet cross section measurements and can provide 
a stringent test of the theoretical models. 

Here the measured R32 is used to perform a study on the validity of different Monte Carlo (MC) 
models currently in use for modeling collisions at the LHC. The high centre-of-mass energy of 
the LHC allows a test of the models in a transverse momentum region that was not accessible 
to previous experiments. Such studies are important because QCD-induced processes consti- 
tute, in some cases, the dominant backgrounds for signals of new physics at the LHC. This 
investigation is complementary to other studies which also aim to probe the validity of pQCD 
early in the LHC era EHS). 

The DO Collaboration at the Fermilab Tevatron has measured the same ratio of cross sections 
for jets in the range 100 < Hj < 600 GeV and pp centre-of-mass energy of 1.8 TeV 0. Hence 
this study extends the range of the R32 measurement in a previously unexplored transverse 
momentum region, and probes pQCD up to Hj = 2.5 TeV. 
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3 Data Selection and Reconstruction 



2 The CMS Detector 

The central feature of the CMS apparatus is a superconducting solenoid which provides an 
axial magnetic field of 3.8 T. The field volume of the solenoid is instrumented with various 
particle detection systems. Charged particle trajectories are measured by the silicon pixel 
and strip trackers, with full azimuthal coverage within \tj\ < 2.5, where the pseudorapid- 
ity t] is defined as \] = —In [tan (6/2)], and 9 is the polar angle. The momentum resolution 
is about 1% at 100 GeV. Surrounding the trackers are a lead-tungstate crystal electromag- 
netic calorimeter (ECAL) [14] with a preshower detector in the endcaps, which covers the re- 
gion \rj | < 3, and a brass /scintillator hadron calorimeter (HCAL) fl5 [l , which extends up to 
= 5. The ECAL has an energy resolution of better than 0.5% for unconverted photons 
with transverse energies above 100 GeV. The energy of charged pions and other quasi-stable 
hadrons can be measured with the calorimeters (ECAL and HCAL combined) with a resolu- 
tion ofAE/E 100 % / y/E [GeV] 5 %, albeit with a nonlinear response. For charged hadrons, 
the calorimeter resolution improves on the tracker momentum resolution only for pj's in ex- 
cess of 500 GeV. The steel return yoke outside the solenoid is instrumented with gas-ionisation 
detectors used to identify and reconstruct muons. The CMS experiment collects data using a 
two-level trigger system: the first-level hardware trigger (LI) [16] and the high-level software 
trigger (HLT) [17]. The LI jet algorithm searches for collimated transverse energy depositions 
using an effective sliding- window algorithm whose size is 1.044 by 1.044 in (rj, cp) space. Each 
jet candidate is assigned rj, (p, and pj values that are used to seed subsequent searches in the 
HLT. The HLT uses data at full granularity and searches for jets using an iterative cone algo- 
rithm HHJ EH of radius R = y 7 (A?/) 2 + (A</>) 2 = 0.5. A more detailed description of the CMS 
detector can be found in Ref . [20] . 

3 Data Selection and Reconstruction 

The jet sample used for this study was collected using the LI and HLT single jet triggers. The 
efficiency of the trigger system is studied using a combination of minimum-bias triggers [|2"T|, 
which are independent of the LI jet trigger, as well as low-threshold calorimeter triggers. Five 
different jet-trigger thresholds are used in this analysis, which ensure 100% trigger efficiency 
for each of the different Hj regions of the jet sample (H T > 0.20, 0.31, 0.40, 0.56, and 0.80 TeV). 
The integrated luminosities corresponding to each of the five regions are 0.4, 4.5, 9.2, 20.0, and 
36.0 pb _1 , respectively. 

Each event is required to have an offline reconstructed vertex position along the beam line 
that is within 24 cm of the nominal interaction point. The global event reconstruction (also 
called particle-flow event reconstruction [22]) consists in reconstructing and identifying each 
single particle with an optimised combination of all sub-detector information. The energy of 
photons is directly obtained from the ECAL measurement, corrected for zero-suppression ef- 
fects. The energy of electrons is determined from a combination of the track momentum at 
the main interaction vertex, the corresponding ECAL cluster energy, and the energy sum of 
all bremsstrahlung photons attached to the track. The energy of muons is obtained from the 
corresponding track momentum. The energy of charged hadrons is determined from a combi- 
nation of the track momentum and the corresponding ECAL and HCAL energy, corrected for 
zero-suppression effects, and calibrated for the non-linear response of the calorimeters. Finally 
the energy of neutral hadrons is obtained from the corresponding calibrated ECAL and HCAL 
energy. 

For each event, hadronic jets are clustered from these reconstructed particles with the infrared 
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and collinear safe anti-Air algorithm, operated with a size parameter R of 0.5. The jet momen- 
tum is determined as the vectorial sum of all particle momenta in this jet, and is found in the 
simulation to be within 5% to 10% of the true momentum over the whole pj spectrum and de- 
tector acceptance. An offset correction is applied to take into account the extra energy clustered 
in jets due to additional proton-proton interactions within the same bunch crossing. Jet energy 
corrections are derived from the simulation, and are confirmed with in situ measurements with 
the energy balance of dijet and photon+jet events [23 1 Additional selection criteria are applied 
to each event to remove spurious jet-like features originating from isolated noise patterns in 
certain HCAL regions. 

Events with two or more jets of Hj > 0.2 TeV, pj > 50 GeV, and \y\ < 2.5 are selected. The 
final dijet sample consists of 376 590 events. A subsample of 206 845 events has an additional 
third jet that passes the pj and y selection criteria. 

4 Monte Carlo Models and Comparisons with Data 

The jet data are compared with the predictions of five QCD-based MC generators that simulate 
jet production in pp collisions at a/s = 7 TeV. 

The PYTHIA 6.4.22 MC generator gU uses LO matrix elements to generate the 2 — > 2 hard 
processes and a PS model to simulate higher-order processes. The PS model gives a good 
description of parton emission when the emitted partons are close in phase space. Events are 
generated with two different tunes of Pythia 6.4: D6T 11251 and Z2. The Z2 tune is identical to 
the Zl tune described in [25] except that Z2 uses the CTEQ6L1 11261 proton parton distribution 
functions (PDF) while Zl uses CTEQ5L ||27| . Parton showers are ordered by mass in D6T and 
by pj i n Z2. Hadronisation is simulated using the Lund string model I281 I29II . 

The Pythia 8 [301 tune C2 |3T1 simulation orders the PS in pj and models the underlying event 
using the multiple parton interaction model from PYTHIA6, including initial- and final-state 
QCD radiation. 

The HERWIG++ 1 32 1 tune 2.3 program takes LO matrix elements and simulates PS using the 
coherent-branching algorithm, with angular ordering of showers. Hadronisation is simulated 
using the cluster model. The underlying event is generated from the eikonal multiple parton- 
parton scattering model. 

The MadGraph |33ll34| generator is used because of its treatment of hard and well-separated 
partons. MadGraph generates tree-level helicity amplitudes and is interfaced to PYTHIA6, 
which generates the rest of the higher-order effects using the PS model. Matching algorithms 
ensure that no double-counting occurs between the tree-level and the PS-model-generated par- 
tons. The MadGraph parameter that distinguishes between PS and tree-level diagrams is set 
to 30 GeV. The tune version is D6T. 

The Alpgen [35] tune D6T MC program also generates tree-level processes. As with Mad- 
Graph, it is interfaced with PYTHIA6 for generating higher-order effects. The jet-parton match- 
ing threshold is set to 20 GeV and the maximum distance between partons and jets to AR = 0.7. 

The CTEQ6L1 PDF set is used to model the proton structure in Pythia 8 and MadGraph, 
while HERWIG++ and Alpgen use the MRST2001 |36l and CTEQ5 PDF sets respectively. 

The generated events from PYTHIA6 tune Z2, MadGraph, and HERWIG++ are processed 
through a full detector simulation based on GEANT4 [37|. 
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5 Extraction of K 32 from the Data 



Jets (denoted MC jets) are reconstructed from particle-flow objects by means of the anti-fcj jet 
algorithm. The jet energy corrections and selection criteria applied to MC events are identical 
to those applied to the data. Jets are also reconstructed using the anti-/cr jet algorithm on the 
MC four-vectors of the stable particles after hadronisation. These (hadron-level) jets, referred 
to as Genjets, represent the MC prediction free of detector effects. 

Detailed studies of the pj, y, and Hj reconstruction are performed using the MC simulation. 
The pj resolution varies from 12% at 50 GeV to 5% at ITeV. These values are in agreement with 
the resolutions extracted from data. The Hj resolution found using MC jets varies from 6% at 
200 GeV to 3.5% at 2.5 TeV. 

Plotted in Fig. [I] (top) are the Hj distributions of the inclusive 2-jet (left) and 3-jet (right) sam- 
ples after all selection criteria (solid circles). The five different jet samples are normalised to 
the integrated luminosity of 36 pb -1 . The measurements are compared to the predictions of 
PYTHIA6 tune Z2 MC (histogram), normalised to the total number of dijet events. Also shown 
in Fig. [T] (bottom) is the ratio between the MC predictions and the data, separately for inclu- 
sive 2-jet and 3-jet events. The MC predictions describe within 20% the shape of the measured 
Hj distributions in data.This test has been repeated using the predictions of MADGRAPH and 
HERWIG++ MC programs with similar results. 



CMS 

Vs= 7 TeV 
anti-K, R=0.5 
> 2 jets 

Data (L =36 pb 1 ) 
PYTHFA6 tune Z2 




2.5 
H T (TeV) 




2.5 
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Figure 1: Hj distributions (top) for inclusive 2-jet (left) and 3-jet (right) events for data (solid 
circles) and Pythia6 tune Z2 MC generator (histogram). Ratios between the MC predictions 
and the data (bottom). 



5 Extraction of K 32 from the Data 

The inclusive 2-jet {dtJil 'dHj) and 3-jet (dcr^/dHj) differential cross sections are extracted from 
the data using 

dHj Lei AHj ' ( ' 
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where Q, £,, and N, are the smearing correction, detection efficiency, and the number of selected 
events for inclusive 2-jet (z = 2) and inclusive 3-jet (i = 3) events respectively, L is the integrated 
luminosity, and AHj is the bin size. 

The smearing and efficiency corrections are computed using the PYTHIA6 tune Z2: Genjet 
events, consisting of all hadron-level events which have at least two jets with pj > 50 GeV 
and |y| < 2.5, and MC jet events, consisting of all events with at least two reconstructed jets 
passing all selection criteria. The efficiency £, is defined as the number of selected events (at 
any reconstructed Hj) arising from a given generated Hj interval divided by the total number 
of generated events in that interval. The efficiency for 2-jet events is 100% throughout the Hj 
range, while the efficiency for 3-jet events increases from 72% at Hj = 200 GeV to 100% above 
Hj = 400 GeV. The smearing correction Q is defined as the number of selected events (at any 
reconstructed Hj) arising from a given generated Hj interval divided by the total number of 
selected events in that interval. The smearing correction C2 is approximately 0.9 throughout 
the Hj range, while C3 is approximately 0.7 at Hj = 200 GeV and increases to 0.9 for Hj > 300 
GeV. 

The ratio R32 as a function of Hj is extracted from the data by dividing the inclusive differential 
3-jet cross section by the inclusive differential 2-jet cross section: 



_ da 3 /dHj 

The overall multiplicative correction factor to the data, C3/C2 x £2/ £3, is within 4% of unity in 
the range 0.2 < Hj < 0.5 TeV and within 2% thereafter. This method of extracting R32 is tested 
by treating the MadGraph sample as the data and using PYTHIA6 as the MC. The M adGraph 
MC jets are treated and corrected (using Pythia6) the same way as the data and the extracted 
ratio from MADGRAPH is compared with the ratio at the hadron-level from MADGRAPH using 
Genjets. The extracted ratio is within ±1% of the Genjet ratio, and this difference is included 
in the overall systematic uncertainty for R32. 

The measured R32 after all corrections is shown in Fig. |2j As expected, R32 rises with increasing 
Hj, as the phase space opens up for the emission of a third jet satisfying the pj > 50 GeV 
selection criterion. The R32 value reaches a plateau of about 0.8, which depends upon the pj 
and y selection criteria as well as the jet-finding algorithm. The R32 plateau value measured 
here is different from that measured by the DO collaboration. The origin of this difference was 
studied using the MC and it is due to the different centre-of-mass energy and pj and y selection 
criteria. 

Several sources of systematic uncertainty on R32 are considered. Uncertainties due to insuf- 
ficient knowledge of the MC Hj distributions are estimated by altering the shape of the MC 
distributions and comparing with the data. Those uncertainties are in the range 3-8%. Un- 
certainties originating from the jet energy scale, the effect of pile-up, insufficient knowledge of 
the jet pj resolution, and self-consistency checks of the extraction method (using MadGraph 
as data) are considered. The contribution of each of these effects is at the 1% level. The com- 
bined systematic uncertainty is estimated by adding in quadrature all these uncertainties and 
is shown as the shaded area in Fig. [2] The combined systematic uncertainty is in the range 
4-10%. 

Also displayed in Fig.|2]are the predictions of the five MC generators considered in this study. 
The ratios, R32 , of the different MC predictions to the measured values are plotted in Fig. [3] The 
shaded area represents the combined statistical and systematic uncertainty for the data. The 
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5 Extraction of K 32 from the Data 
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Figure 2: The measured R32 (solid circles) as a function of Hj, and the predictions of PYTHIA6, 
PYTHIA8, MADGRAPH, ALPGEN, and HERWIG++ (curves). Error bars represent statistical un- 
certainties. The shaded area indicates the size of the combined systematic uncertainty. 
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Figure 3: The ratios of the predicted R32 values from the PYTHIA6, Pythia8, MadGraph, 
Alpgen, and HERWIG++ Monte Carlo generators to the measured value, as a function of Hj. 
The shaded area indicates the size of the combined statistical and systematic uncertainty. 



predictions of MadGraph agree with the data throughout the Hj range of the measurement. 
On the other hand, previous studies of event shapes and dijet angular decorrelations ITTl |12| 
indicate that MadGraph does not describe these distributions well. However, R32 is con- 
structed from inclusive measurements and is mainly sensitive to the probability of emitting a 
third parton in the final state, whereas the other two distributions depend mainly on the final 
state of the event. In contrast, the predictions of ALPGEN, PYTHIA6, PYTHIA8, and HERWIG++ 
are in agreement with the measured R32 for Hj > 0.5 TeV, although they overestimate it for 
lower values of Hj. 



6 Summary 

The ratio of the inclusive 3-jet to 2-jet cross sections, for jets with pj > 50 GeV and \y\ < 2.5, 
has been measured in the range 0.2 < Hj < 2.5 TeV for proton-proton collisions at a centre-of- 
mass energy of 7 TeV. The measured ratio rises with increasing Hj, as the phase space opens 
for the production of a third jet, reaching a plateau value of about 0.8 for Hj > 1 TeV. 

This study tests the prediction of the different MC generators considered at TeV scales. PYTHIA6 
tune Z2, MadGraph, and HERWIG++ describe within 20% the shape of the H T distributions 
in data. 

The predictions of MadGraph, which generates tree-level helicity amplitudes, are in agree- 
ment with the measured R32 throughout the range of this measurement. ALPGEN, which also 
uses this method, describes well the data for Hj > 0.5 TeV, but overestimates R32 for lower Hj 
values. The difference between the predictions of MadGraph and Alpgen is an estimate of 
the uncertainty of the theoretical predictions due to the different jet-parton matching parame- 
ters used by the two MC programs. The current results combined with previous measurements 
of event shapes and dijet angular decorrelations indicate that whilst MadGraph has difficul- 
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6 Summary 



ties in describing the details of the event shapes and dijet angular decorrelations, it predicts 
correctly the probability of emitting a third parton in the final state. The MC programs that use 
LO matrix elements to simulate the hard scattering and PS for all higher-order effects (PYTHIA6, 
PYTHIA8, HERWIG++) describe well the data above H T > 0.5 TeV, but overestimate R 32 below 
this value. This is the first measurement of this ratio in the multi-TeV region, extending to an 
Hj range that has not been explored before. 
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